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Abstract 

For the Lego discrepancy with M bins, which is equivalent with a x^-statistic 
with M bins, we present a procedure to calculate the moment generating function of 
the probability distribution perturbatively if M and A^, the number of uniformly and 
randomly distributed data points, become large. Furthermore, we present a phase 
diagram for various limits of the probability distribution in terms of the standardized 
variable if M and become infinite. 
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1 Introduction 



The importance of the notion of uniformity of point sets in the problem of numerical 
integration has been stressed in many publications (see e.g. |T|, 0, ^). In general, a 
discrepancy is a measure of non-uniformity of point sets in an integration region. For a 
set of random points it is a function of random variables. For the Lego discrepancy, 
the integration region is considered to be divided into a number of, say M bins, so that 
it becomes a function of M random variables, namely the number of points in each of the 
bins. Furthermore, the Lego discrepancy is defined such that it is exactly the x^-statistic 
of the binned data points. 

In we have given criteria for the asymptotic probability distribution of various 
quadratic discrepancies to become Gaussian when a certain free parameter becomes in- 
finitely large. This parameter often is the dimension s of the integration region. In the 
case of the Lego discrepancy, it is the number of bins M. In it is shown that for 
the Fourier diaphony a Gaussian limit is obtained when both N and s go to infinity such 
that c'^/N — i> 0, where c is some constant larger than 1. This theorem clearly gives more 
information about the behavior of the probability distribution than the statements of [Q, 
for it relates s and A^, whereas in ^ the limit of — >■ cx) is assumed before considering 
the behavior with respect to s or M. 

In 1^ and [0, we introduced techniques to calculate the 1/A^ expansion of the moment 
generating function of the probability distribution of quadratic discrepancies, and applied 
it to the Lg-discrepancy, the Fourier diaphony (both for s = 1), and the Lego discrepancy. 
Such an expansion can be used to calculate corrections to the asymptotic distribution for 
A^ — > oo, but, presented as in 0, it cannot give information about limits if M or s become 
infinite also. In this paper we use the mentioned technique to calculate limits for the Lego 
discrepancy if M as well as A^ become infinite. 

First, we will show that the natural expansion parameter in the calculation of the 
moment generating function is M/N, and calculate a few terms. We will see, however, 
that a strict limit of M ^ cx) does not exist, and, in fact, this is well known because 
the x^-distribution, which gives the lowest order term in this expansion, does not exist if 
the number of degrees of freedom becomes infinite. We overcome this problem by going 
over to the standardized variable, which is obtained from the discrepancy by shifting and 
rescaling it such that it has zero expectation and unit variance. In fact, it is this variable 
for which the results in and ||^ were obtained. In this paper, we derive similar results 
for the Lego discrepancy, depending on the behavior of the sizes of the bins if M goes to 
infinity. We will see that various asymptotic probability distributions occur if M, N —>■ oo 
such that M"/N constant with a > 0. If, for example, the bins become asymptotically 
equal and a > |, then the probability distribution becomes Gaussian. Notice that this 
includes limits with a < 1, which is in stark contrast with the rule of thumb that, in order 
to trust the x^-distribution, each bin has to contain at least a few, say five (see e.g. |^), 
data points. Our result states that, for large M and A^, the majority of bins is allowed to 
remain empty! 
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2 The x'^-statistic as a discrepancy 



The x^-statistic for data points distributed over M bins with expected number of points 
WnN for bin n = 1, . . . , M is given by 

^ = ^ ^ ' 

n=l "^"^^ 

where A/'„ is the number of points in bin n. If the data points are distributed truly random, 
i.e., if they are independent and if the probabihty for a point to fall in bin n is equal to 
1/wn, then it is distributed along a multinomial distribution: 

M ja 

P[x'<t] = E - ^' n TTT ' (2) 

{Mm} rn=l 

where the sum is over all configurations {Afm} with Ylm=i-^m = N. In the limit of 
an infinite number of data points, this becomes the x^-distribution, which has moment 
generating function 

^ Goiz) . (3) 



;i - 2z) 



M-l 
2 



A first step in the interpretation of this statistic as a quadratic discrepancy is the insight 
that it is quadratic in the variables N'n and can be characterized by a matrix with elements 

i3.,^ = ^-l, (4) 

SO that = jf^nm-^nBn,m-^m- Furthermore, we assume that the bins are disjunct 
subsets of an integration region, and that the data points Xk, 1 < k < N are in this 
integration region. The measure of subset n is then equal to Wn- The union of the subsets 
we call K; it has measure 

M 



Y^Wn = 1 . (5) 



n=l 



The number of points Mn is equal to Yl!k=i'^n{xk)-, where dn is the characteristic function 
of bin n, so that 



,2 ' 



X" = 4 X] ^n{Xk)l3n,m^m{Xl) = ^ B{Xk, Xi) = ■ (6) 



/ J y J ^ IL V — / — ^ lib V — ^ / 

kl=l n,m=l kd=l 



The interpretation of a quadratic discrepancy in the definition following ||^, ^ is possible, 
because the two-point function B as obtained above integrates to zero with respect to each 
of its variables. From now on, we will call Dn the Lego discrepancy. 
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3 Sequences and notation 

In the following, we will investigate limits in which the number of bins M goes to infinity. 
Note that for each value of M, we have to decide on the values of the measures Wn- 
They clearly have to scale with M, because their sum has to be equal to one. There 
arc, of course, many possible ways for the measures to scale, i.e., many double-sequences 
{w'^\ 1 <n < M, M > 0} of positive numbers with 

M M 

Vwi^) = 1 VM>0 and lim Y wi""^ = 1 ■ (7) 

n=l n=l 

We, however, want to restrict ourselves to discrepancies in which the relative sizes of the 
bins stay of the same order, i.e., sequences for which 

inf Mwi*^) e (0, oo) and sup Mw^^) e (0, oo) . (8) 

It will appear to be appropriate to specify the sequences under consideration by another 
criterion, which is for example satisfied by the sequences mentioned above. It can be 
formulated in terms of the objects 



n=l \Wn 



and is given by the demand that 

Within the set of sequences we consider, there are those with for which the bins become 
asymptotically equal, i.e., sequences with 

= \ " with £W>-1, l<n<M and lim max le^f = . (11) 

They belong to the set of sequences with hp — 1 y p > 1, which will allow for special 
asymptotic probability distributions. 

In the following analysis, we will consider functions of M and their behavior if M — cxo. 
To specify relative behaviors, we will use the symbols "~" , "x" and . The first one is 
used as follows: 

MM)~MM) ^ = (12) 

If a limit as above is not necessarily equal to one and not equal to zero, then we use the 
second symbol: 

/i(M) X /2(M) ^ /i(M) - c/2(M) , c e (0, oo) . (13) 
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We only use this symbol for those cases in which c 7^ 0. For the cases in which c = we 
use the third symbol: 

We will also use the O-symbol, and do this in the usual sense. We can immediately use 
the symbols to specify the behavior of Mp with M, for the criterion of Eq. (|TD|) tells us that 

Mp^MP , (15) 

and that 

Mp^ if hp = l . (16) 

In our formulation, also the number of data points runs with M. We will, however, 
never denote the dependence of on M explicitly and assume that it is clear from now 
on. Also the upper index at the measures Wn we will omit from now on. 



4 Feynman rules 



In and we have shown that the moment generating function G : z ^-^ E[e^^^] of 
the probability distribution of the Lego discrepancy can be written as an integral over M 
ordinary degrees of freedom (the bosons) and 2N Grasmannian degrees of freedom (the 
fermions). The integral is not unique and contains a gauge freedom. Furthermore, it 
introduces a natural expansion parameter 



9= \^ 




2z 



(17) 



to calculate the generating function perturbatively. In one particular gauge, the Landau 
gauge, the terms are equal to the contribution of Feynman diagrams, that can be calculated 
according to the following rules: 



boson propagator: 
fermion propagator: 

vertices: 



n ■ 
i ■ 



m 
j 



B. 



n,m 1 



-g^ X convolution , p >2 



rule 1 
rule 2 

rule 3 



To calculate a term in the expansion series, the contribution of all vacuum diagrams, i.e. all 
diagrams without external legs, carrying the same power of g has to be taken into account. 
In the Landau gauge, only connected diagrams have to be calculated, for 



logG{z) = the sum of the connected vacuum diagrams. 



rule 4 
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In the vertices, boson propagators are convoluted as ^^^^ WmJ3m,niJ3m,n2 " " " ^m,np, fermion 
propagators as J2f=i^ii,j^j,i2i then these convolutions are multiplied. As a result of 
this, the bosonic part of each diagram decouples completely from the fermionic part. The 
contribution of the fermionic part can easily be determined, for 

every fermion loop only gives a factor —N. rule 5 



4.1 Rules for the bosonic parts 

Because of the rather simple expression for the bosonic propagator, we are able to 
deduce from the basic Feynman rules some effective rules for the bosonic parts of the 
Feynman diagrams. Remember that the bosonic parts decouples completely from the 
fermionic parts. The following rules apply after having counted the number of fermion loops 
and the powers of g coming from the vertices, and after having calculated the symmetry 
factor of the original diagram. When we mention the contribution of a diagram in this 
section, we refer to the contribution apart from the powers of g and symmetry factors. 
This contribution will be represented by the same drawing as the diagram itself. 
The first rule is a consequence of the fact that 



M 

E 

n=l 



and states that all vertices with only two legs that do not form a single loop can be removed. 
The second rule is a consequence of the fact that for any M x A-f-matrix / 



M M M 

n,m=l n=l n,m=l 

and states that the contribution of a diagram is the same as that of the diagram in which a 
boson line is contracted and the two vertices, connected to that line, are fused together to 
form one vertex, minus the contribution of the diagram in which the line is simply removed 
and the vertices replaced by vertices with one boson leg less. This rule can be depicted as 
follows 



rule 6 



By repeated application of these rules, we see that the contribution of a connected bosonic 
diagram is equal to the contribution of a sum of products of so called daisy diagrams Q, 

^For example ■ — {/ V; — f----'; ~ — 2 f-----; ~ — C >C '■ ~ ^ (.','">-','""-' ~ '-',"".') 
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which are of the type 




They are characterized by the fact that all lines begin and end on the same vertex and 
form single loops. The contribution of such a diagram is given by 

AI / \ 

d,{M) = Y^WnBl^ = Y^rji-iy-m, = M,[1 + 0{M-')] , (21) 

n=l q=0 

where the last equation follows from Eq. ([T5|). The maximal number of leaves, a product 
in the sum of daisy diagrams contains, is equal to the number of loops in the original 
diagram, so that 

the contribution of a diagram with Lb hoson loops is Mls[1 + 0{M'^)]. rule 7 
The leading order contribution of a diagram with Lb boson loops is thus of the order of 



4.2 Extra rule if hp = 1 

If hp = 1 \/p > 1, then all kind of cancellations between diagrams occur, because in 
those cases Mp ~ Vp > 1. As a result of this, the contribution of a daisy diagram is 
dp{M) ~ M^, and we can deduce the following rule: the contribution of a diagram that 
falls apart in disjunct pieces if a vertex is cut, is equal to the product of the contributions 
of those disjunct pieces times one plus vanishing corrections. Diagrammatically, the rule 
looks like 

(^)(^ ~ (a) X , (22) 



In [0 we called discrepancies for which Eq. (|22D is exact one- vertex decomposable, and have 
shown that for those discrepancies only the one-vertex irreducible diagrams contribute, i.e., 
diagrams that do not fall apart in pieces containing bosonic parts if a vertex is cut. The 
previous rule tells us that, if /ip = 1 Vp > 1, then 

log G{z) ~ sum of all connected one-vertex irreducible diagrams. rule 8 

The connected one- vertex irreducible diagrams we call relevant and the others irrelevant. 
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5 Loop analysis 



We want to determine the contribution of the diagrams in this section, and in order to do 
that, we need to introduce some notation: 

Lb = the number of boson loops ; (23) 

Lp = the number of fermion loops ; (24) 

L = the total number of loops ; (25) 

Ib = the number of bosonic lines ; (26) 

Ip = the number of fermionic lines ; (27) 

V = the number of vertices ; (28) 

Lm = L — Lb — Lp = number of mixed loops . (29) 

These quantities are in principle functions of the diagrams, but we will never denote this 
dependence explicitly, for it will always be clear which diagram we are referring to when 
we use the quantities. 

With the foregoing, we deduce that the contribution Ca of a connected diagram A with 
no external legs satisfies 

Ca X M^^N^^g'^^'' . (30) 

The FejTiman rules and basic graph theory tell us that, for connected diagrams with no 
external legs, v = Lp and L = Ib + If — v + 1, so that 

Ib = L-1 = Lb + Lp + Lm-1 ■ (31) 

If we furthermore use that g = \f2zjN , we find that the contribution is given by 

- AT^MAT^.-i ^2-)'^ ■ (32) 

Notice that this expression does not depend on Lp. Furthermore, it is clear that, for 
large M and A^, the largest contribution comes from diagrams with La/ = 0. Moreover, 
we see that we must have N = 0{M), for else the contribution of higher-order diagrams 
will grow with the number of boson loops, and the perturbation series becomes completely 
senseless. If, however, x M, then the contribution of each diagram with = is more 
important than the contribution of each of the diagrams with Lm > 0. Finally, we also see 
that the contribution of the (9(M~^)-corrections of a diagram (Eq. (^)) is always negligible 
compared to the leading contribution of each diagram with Lm = 0. These observations 
lead to the conclusion that, if and M become large with x M, then the leading 
contribution to logG{z) comes from the diagrams with Lm = 0, and that there are no 
corrections to these contributions. If we assume that M/N is small, then the importance 
of these diagrams decreases with the number of boson loops Lb as (M/N)^'^ . 
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5.1 The loop expansion of logG(2;) 



Now we calculate the first few terms in the loop expansion of log G{z). We start with the 
diagrams with one loop (remember that it is an expansion in boson loops and that we only 
have to calculate connected diagrams for logG{z)). The sum of all 1-loop diagrams with 
La^- = is given by 







p=i 



M -1 



log(l - 2z) 



(33) 



and this is exactly equal to logG'o(-2) as we know it for the Lego-discrepancy (cf.p). The 
fractions before the diagrams are the symmetry factors. From now on, we will always write 
them down explicitly. To calculate the higher loop diagrams, we introduce the following 
effective vertex: 



...0. 



■- p _ 1 



(34) 



and the following partly re-summed propagator: 
m = n m + n 



n ■ 



p=0 



m 



■ m + n « 

-I- n 4 

1 

X n 



-• m 

— • m + 



1 - 2z 



m 



(35) 



It is the propagator Qn,m, from The contribution of the 2-loop diagrams with Lm = 
is given by 



1 Ng'^M'^ ^1 Ng'^M2 ^ I {N g^f {M2 - M^) ^ 1 {Ng^fM2 



1 

iV 



[l-2zY 8(l-2z)2 8 (1-2^)3 12 {l-2zf 

[1 + 0{M-^)] , 



-(M2 - M^)7]{zf + —f- - — r]{z^^ 



24 



where we define 



r]{z) 



2z 
1 - 2z 



[1 + 0{M-^)] 
(36) 

(37) 
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Notice that the first three diagrams vanish if /ip = 1 Vp > 1. The contribution of the 
3-loop diagrams with Lm = is given by 



48 



+ 



+ sCX>C>0 + i^ 






1 

16 




1 

+ 24 




+ Y6CX>CX>0 



(38) 



\i hp = 1 \/p > 1, then only the first four diagrams are relevant, and their contribution C 
satisfies 



C 



M3 
7V2 



(39) 



6 Various limits 



In the previous calculations, M/N was the expansion parameter and the expansion of the 
generating function only makes sense if it is considered to be small. In fact, a limit in 
which M — > oo does not even exist, because the zeroth order term is proportional to M. 
In order to analyze limits in which M as well as N go to infinity, we can go over to the 
standardized variable {Dj^ — E)/y/V of the discrepancy, where 



E 
V 



y[DN] 



M 



2(M- 1) + 



M2 - - 2(M - 1) 
TV 



In terms of the standardized variable, the generating function is given by 
G{0 = E 



(40) 
(41) 

(42) 
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Instead of the variable 2;, the variable ^ = zyV is considered to be of C(l) in this perspec- 
tive and the contribution of a diagram changes from (|3^ ) to 



Ca 



Ib 



(43) 



In the following we will investigate limits of M ^ 00 with, at first instance, the criterion 
of Eq. (PUD as only restriction. The fact that the variance V shows up explicitly in the 
contribution of the diagrams, forces us to specify the behavior of M2 more precisely. We 
will take 



M2 -M^ 



< 7 < 2 



(44) 



Notice that /i2 = 1 if 7 < 2 and that /i2 does not exist if 7 > 2. Furthermore, we cannot 
read off the natural expansion parameter from the contribution of the diagrams anymore, 
and have to specify the behavior of A^. We will only consider limits in which 



iVxM" , a>0 



(45) 



Although they are a small subset of possible limits, those that can be specified by a pair 
(a, 7) show an interesting picture. We will derive the results in the next section, but 
present them now in the following phase diagram: 



7t 







u 







a 



It shows the region S = {(«,7) G | « G [0, 00), 7 G [0,2]} of the real (a,7)-plane. In 
this region, there is a critical line £, given by 



i = {{m,t)eS\te [0,2]} with Ut) 



if < t < 



2 ' 



2 - t if I < t < 2 



(46) 



It separates S into two regions T and U, neither of which contains £. Our results are the 
following. Firstly, 



in the region T, the hmit of M ^ 00 is not dehned. 



(47) 
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In this region, the standardized variable is not appropriate, and we see that there are to 
many diagrams that grow indefinitely with M. Secondly, 



in the region U, the hmit of M ^ oo gives a Gaussian distribution. (48) 

Because we used the standardized variable, this distribution has necessarily zero expecta- 
tion and unit variance. Finally, 

on the hne £, various hmits exist, depending on the behaviour of Mp, p > 2. (49) 

One of these limits we were able to calculate explicitly. It appears if Mp — -< M^~^ 
Vp > 1, which is, for example, satisfied in the case of equal binning. In this limit, the 
generating function is given by 

logGtt) = i(e^^-l-A«) , A= to^^. (50) 

In Appendix A, we show that the probability distribution H belonging to this generating 
function, which is the inverse Laplace transform, is given by 



h{t) = 

neN 



nX — — 
A 



It consists of an infinite number of Dirac delta-distributions, weighed with a Poisson dis- 
tribution. The delta-distributions reveal the fact that, for finite N and M, the Lego 
discrepancy, and also the x^-statistic, can only take a finite number of values, so that the 
probability density should consist of a sum of delta-distributions. In the usual limit of 
oo, the discrete nature of the random variable disappears, and the ^^-distribution is 
obtained. In our limit, however, the discrete nature does not yet disappear. A continuous 
distribution is obtained if A — > 0, which corresponds with going over from a = | to a > |. 

Then G(0^exp(|a- 

7 Derivation of the various limits 

We will treat the cases ^ — 2 and 7 < 2 independently. 
7.1 7 = 2 

We distinguish the three cases 0<a<l, a — 1 and a > 1. 

If a > 1, then y x M, and the contribution Ca of a diagram A satisfies Ca >^ M^, 
with 

/3 = (I - a)LB - \Lp + (a + 1)(1 - Lm) • (52) 
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A short analysis shows that only diagrams with {Lb, Lp, Lm) = (1, 1, 0) or {Lb, Lp, Lm) = 
(1,2,0) give a non-vanishing contribution, and those diagrams are 



1 _ 1 N{M - m _ Ei 



\<x:::<y^^f^^^^^o{M-^)- (-) 

The first diagram gives a contribution that is linear in ^ and cancels with the exponent in 
Eq. (^). This has to happen for every value of a, and as we will see, this diagram will 
occur always. Notice that the diagrams above are the first two diagrams in the series on 
the l.h.s of Eq. (pSf). The logarithm of the generating function becomes quadratic, so that 
the probability distribution becomes Gaussian. 

If a = 1, then again V x M, so that (3 = —^{Lb + Lp) + |(1 — Lm), and we have to 
add the diagrams with {Lb, Lp, Lm) = (2, 1, 0): 



1(f) ^ 1 N{M2 - M^)Ae ^ (Ma - M^)e 

8'"X"^ 8'^-'' V^^'-' ~ 8 N'^V ~ 2NV 



(55) 



If < a < 1, then V x M^"" and (3 = -^Lb - (1 - ^)Lp - (f + 1)Lm + f + 1, so 
that, besides the diagram of Eq. (|53D, only the diagrams of Eq. (|55D give a non- vanishing 
contribution, and this contribution is equal to ^^/2. 

7.2 < 7 < 2 

We can distinguish the two cases 7 — a < 1 and 7 — a > 1. 
7.2.1 7 - a < 1 

In this case, V x M, and the contribution Ca of a diagram A satisfies Ca ^ with 

/3 = (i - a)LB - \Lp + {a + \){l - Lm) • (56) 

If a < ^, then j3 increases with the number of boson loops Lb, and we are not able to 
calculate the limit of M — > cxd. 

If a > |, then the only diagrams that have a non- vanishing contribution are those with 
{Lb, Lp, Lm) = (1,1,0), (1,2,0) or (2,1,0). These are exactly the diagrams of Eq. ([53|), 
Eq. (|54D and Eq. (|55|). Notice, however, that the diagrams of Eq. (|55|) cancel if 7 — a < 0: 
then they are irrelevant. The resulting asymptotic distribution is Gaussian again. 

If a = |, then Lb disappears from the equation for [3, and we obtain a non-Gaussian 
asymptotic distribution. The diagrams that contribute are those with {Lp,Lm) = (1,0) 
or (2,0). There is, however, only one relevant diagram with {Lp,Lm) = (1,0), namely 
the diagram of Eq. ( ^31) that gives the linear term. We have to be careful here, because 
the other diagrams with {Lp,Lm) = (1,0) still might be non-vanishing. A short analysis 
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shows that they are given by the sum of all ways to put daisy diagrams to one fermion 
loop, and that their contribution is given by 

C,(M)^^,og(l.f:M^) . (57) 

We know that, if hp = 1, then dp{M) = Mp[1 + ep{M)] with limM^oo£p{M) = 0, so that 

C,(M) = liVM/ + AT log + e-i^^^ ^^^^y^^^^ j . (58) 

The first term gives the leading contribution; the contribution of the relevant diagram, 
which consists of a boson loop and a fermion loop attached to one vertex. The second 
term is irrelevant with respect to the first, but can still be non- vanishing, depending on the 
behavior of ep{M). Remember that a = | and V M, so that Mg'^ = 2^M/{NVV) 
constant, and we can see that the contribution is only vanishing if 

lim Nep{M) = Vp > 1 ^ Mp - ^ M^'^ > 1 . (59) 

M— »oo 

For p = 1 this relation is satisfied because ei{M) = 0. For p = 2 this relation is also 
satisfied if 7 < |. 

If the relation is also satisfied for the other values of p, then the only diagrams that 
contribute to the generating function are the relevant diagrams with {Lp, Lm) = (2, 0): 

f 4 + + ^:""> + ■■■ , (60) 



where we used the effective vertex (^4|) again. The contribution of a diagram of this type 
with p boson lines is given by 



The factor 1/2 p! is the symmetry factor of this type of diagram. If we sum the contribution 
of these diagrams and use that V ~ 2M, we obtain 

logG(0 ~ T^(e^^-1-A0 , A= lim ^ . (62) 
7.2.2 7 - q: > 1 

In this case, V x M"'~°' and the contribution Ca of a diagram A satisfies Ca ^ with 

/?=(!- 2±l)Ln -^Lp + ^il- Lm) ■ (63) 
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If 7 + « < 2, then (3 increases with the number of boson loops Lb, and we are not able 
to calculate the limit of M oo. 

If 7 + a > 2, then the only diagrams that have a non- vanishing contribution are 
those with (L^, Lp, Lm) = (1, 1, 0), (1, 2, 0) or (2, 1, 0). These are exactly the diagrams of 
Eq. (|^), Eq. ( [5^ ) and Eq. (|55|). Notice, however, that the diagrams of Eq. (|^) cancel if 

7 — a < 0: then they are irrelevant. The resulting asymptotic distribution is Gaussian. 

If 7 + a = 2, then {3 = {a — l)Lp + 1 — Lm- Because 7 — a > 1, we have a < |, and 
non-vanishing diagrams have {Lf,Lm) = (1;0). Their contribution is given by the r.h.s. 
of Eq. (pSD , the first term of which gives the term linear in ^. The second term is non- 
vanishing, because Mg'^ x M^"'^'''"'"")/^ — constant and Ne2{M) x M°'~^"'~'^ constant. 

8 Conclusions 

We have shown that the Lego discrepancy with M bins is equivalent to a x^-statistic with 
M bins. We have presented a procedure to calculate the moment generating function of 
the probability distribution of the discrepancy perturbatively if M and N, the number 
of uniformly and randomly distributed data points, become large. The natural expansion 
parameter we have identified to be M/N, and we have calculated the first few terms in the 
series explicitly. 

In order to calculate limits in which N,M ^ 00, we have introduced the objects of 
Eq. (P) and restricted the behavior of the size of the bins such that they satisfy Eq. ([I0|). 
Furthermore, we have gone over to the standardized variable of the discrepancy. For this 
variable, we have derived a phase diagram, representing the limits specified by Eq. ( ^41) 
and Eq. (|^). We have formulated the results in (|7|), & and (||). 

On of these results is that there are non-trivial limits if N,M ^ 00 such that AL^/N — > 
constant with a < 1. This result is in stark contrast with the rule of thumb that, in order 
to trust the x^-distribution, each bin has to contain at least a few data points. 

Appendix A 

We want to calculate the integral 




1 




— ZT 



(64) 



We will make use of the fact that 




(65) 
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for all n G Z, so that 



(2n + l)7r^ 
A 



(66) 



(2n-l)7ri 
A 



Notice that the integral is independent of n, so that the sum can be interpreted as a sum 
of Dirac delta-distributions: 



E^ 



— n 



ngZ neZ ^ ^ riGZ 



1 + Ar 
"1^ 



nX — — 
A 



(67) 



These delta-distributions restrict the values that r can take. If we use these restrictions and 
do the appropriate variable substitutions, the remaining integral in ( |66D can be reduced to 



exp i — — mp] dip 



A 



W 



n+l ' 



(68) 



where e Z and the contour is closed around w = 0. According to Cauchy's theorem, 
the final integral is only non-zero if G N, and in that case its value is 27ri^(^)". The 
combination of these results gives Eq. (pl|). 
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